Regularizing End-to-End Speech Translation with Triangular Decomposition Agreement
نویسندگان
چکیده
End-to-end speech-to-text translation (E2E-ST) is becoming increasingly popular due to the potential of its less error propagation, lower latency, and fewer parameters. Given triplet training corpus〈speech, transcription, translation〉, conventional high-quality E2E-ST system leverages the〈speech, transcription〉pair pre-train model then utilizes translation〉pair optimize it further. However, this process only involves two-tuple data at each stage, loose coupling fails fully exploit association between data. In paper, we attempt joint probability transcription based on speech input directly leverage such Based that, propose a novel regularization method for improve agreement dual-path decomposition within data, which should be equal in theory. To achieve goal, introduce two Kullback-Leibler divergence terms into objective reduce mismatch output probabilities dual-path. Then well-trained can naturally transformed as models by pre-defined early stop tag. Experiments MuST-C benchmark demonstrate that our proposed approach significantly outperforms state-of-the-art baselines all 8 language pairs while achieving better performance automatic recognition task.
منابع مشابه
End-to-End Automatic Speech Translation of Audiobooks
We investigate end-to-end speech-to-text translation on a corpus of audiobooks specifically augmented for this task. Previous works investigated the extreme case where source language transcription is not available during learning nor decoding, but we also study a midway case where source language transcription is available at training time only. In this case, a single model is trained to decod...
متن کاملEnd-to-End Evaluation in JANUS: A Speech-to-speech Translation System
JANUS is a multi-lingual speech-to-speech translation system designed to facilitate communication between two parties engaged in a spontaneousconversation in a limited domain. In this paper we describe our methodology for evaluating translation performance. Our current focus is on end-to-end evaluations the evaluation of the translation capabilities of the system as a whole. The main goal of ou...
متن کاملAn Experimental Methodology for an End-to-End Evaluation in Speech-to-Speech Translation
This paper describes the evaluation methodology used to evaluate the TC-STAR speech-to-speech translation (SST) system and their results from the third year of the project. It follows the results presented in (Hamon et al., 2007), dealing with the first end-to-end evaluation of the project. In this paper, we try to experiment with the methodology and the protocol during the second end-to-end ev...
متن کاملComparison of nerve repair with end to end, end to side with window and end to side without window methods in lower extremity of rat
Abstract Background : Although, different studies on end-to-side nerve repair, results are controversial. The importance of this method in case is unavailability of proximal nerve. In this method, donor nerves also remain intact and without injury. In compare to other classic procedures, end-to-side repair is not much time consuming and needs less dissection. Overall, the previous studies i...
متن کاملEnd-to-End Evaluation of a Speech-to-Speech Translation System in TC-STAR
The paper describes an evaluation methodology to evaluate speech-to-speech translation systems and their results. The evaluation scheme uses questionnaires filled in by human judges for addressing the adequacy and fluency of audio translation outputs and was applied in the second TC-STAR evaluation campaign. The same evaluation methodology is carried out both on the outputs of an automatic syst...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence
سال: 2022
ISSN: ['2159-5399', '2374-3468']
DOI: https://doi.org/10.1609/aaai.v36i10.21303